THE GROUND STATE ENERGY OF HEAVY ATOMS: 
RELATIVISTIC LOWERING OF THE LEADING ENERGY 

CORRECTION 



RUPERT L. FRANK, HEINZ SIEDENTOP, AND SIMONE WARZEL 

Abstract. We describe atoms by a pseudo-relativistic model that has its ori- 
gin in the work of Chandrasekhar. We prove that the leading energy correction 
for heavy atoms, the Scott correction, exists. It turns out to be lower than in 
the non-relativistic description of atoms. Our proof is valid up to and including 
the critical coupling constant. It is based on a renormalization of the energy 
whose zero level we adjust to be the ground-state energy of the corresponding 
non-relativistic problem. This allows us to roll the proof back - by relatively 
simple technical means - to results for the Schrodinger operator. 



1. Introduction 

The energy of heavy atoms has attracted considerable interest that dates back to 
the advent of quantum mechanics. As in classical mechanics it soon became clear, 
that the exact solution of problems involving more than two particles interacting 
through Coulomb forces is not possible. Thomas [60] and Fermi [22, 23] introduced 
their description of such atom by the particle density and Lenz [31], who wrote down 
the corresponding energy functional which we will use here (see (7)), addressed this 
question and derived that the ground state energy of atoms should decrease with 
the atomic number Z as Z 7 / 3 . Scott predicted that this could be refined by an 
additive Z 2 -correction. Considerably later Schwinger [46] argued also for Scott's 
prediction. Schwinger [47] and Englert and Schwinger [10, 11, 12] even refined these 
considerations by adding more lower order terms (see also Englert [9]). 

The challenge to address the underlying question whether the predicted for- 
mulae would yield asymptotically correct results when compared with the N- 
particle Schrodinger theory was for a long time unsuccessful. It were Lieb and 
Simon who proved in their seminal paper [36] that the prediction of Thomas, 
Fermi, and Lenz is indeed asymptotically correct. Alternative proofs were given 
by Thirring [59] (lower bound), Lieb [33], and Balodis and Solovej [40]. The Scott 
correction was established by Hughes [26, 27] (lower bound), and Siedentop and 
Weikard [48, 49, 50, 51, 52] (lower and upper bound). In fact, even the existence 
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of the Z 5 ' 3 -correction conjectured by Schwinger was proven (Fefferman and Seco 
[18, 19, 20, 13, 21, 16, 14, 15, 17]). Later these results were extended in various 
ways, e.g., the Scott correction to ions (Bach [1, 2]), to molecules (Ivrii and Sigal 
[29], Solovej and Spitzer [58, 57], Balodis [4]), and to molecules in the presence 
of magnetic fields (Sobolev [55] and Ivrii [30]). Ivrii [28] extended the validity of 
Schwinger's correction to the molecular case. 

Nevertheless, from a physical point of view, these considerations are questionable, 
since large atoms force the bulk of the electrons on orbits that are close to the 
nucleus (of order Z -1 / 3 ) where the electrons move with high speed which requires 
a relativistic treatment. Schwinger [47] has estimated this effect concluding that 
they should contribute to the Scott correction wheras the leading term should be 
unaffected by the change of model. S0rensen [44] was the first who proved that the 
Thomas-Fermi term is indeed left unaffected when the non-relativistic Hamiltonian 
is replace by the Chandrasekhar operator in the limit of large Z and large velocity 
of light c. Cassanas and Siedentop [5] showed, that similarly to the Chandrasekhar 
case, the leading energy is not affected for the Brown-Ravenhall operator. 

Recently, Solovej, S0rensen, and Spitzer [56] announced a proof that a correction 
is at most of the order Z 2 although no claim on the actual value of the coefficient was 
made. (See also S0rensen [43] for the non- interacting case). In the present paper, we 
give an alternate proof of the Scott correction of the Chandrasekar operator, which 
we present - for simplicity - in the atomic case. Our proof relies heavily on semi- 
classical approximation for electrons that are far enough from the nucleus. However, 
we use them only indirectly relying on known results about the non-rclativistic 
Scott correction. In addition we use only relatively standard technical means as 
Lieb-Thirring and Hardy inequalities. Our basic strategy is a renormalization of 
the energy setting the energy of the Schrodinger atom as zero. Moreover, we are 
able to extend the result of [56] to the case of the critical coupling constant. In 
view of the corresponding situation for the Dirac operator (see Remark (3) after 
Theorem 1.1), this is a subtle and not at all obvious observation. 

However, the question of whether the Schwinger correction which lives on the 
scale Z~ 2 / 3 also exists in this relativistic model and - if so - cannot be answered 
with our techniques and is, therefore, left open. 

The energy of an heavy atom is described by a quadratic form 




N 

a) v~(^, J2( T *~ z \ x \~ 1 )»+ E !■'■<< ■ 

i/=l l<ii<v<N 

with 

(2) Q N := f\ C^(R 3 )®C q . 



N 



The superscript # refers to the following two operators which are self-adjointly 
realized in L 2 (R 3 ) ® C q : 



Chandrasekhar operator: T c := \J c 2 p 2 + c 4 c 2 
Schrodinger operator: T s := \p 2 . 
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The parameter q G N represents the possible number of spin states per electron 
which - physically - has the value 2; Z is the atomic number, c is the velocity of 
light, N is the electron number. We use units in which m — e 2 = h = 1. 

A word about names: we address operators of the form T c + V with a potential 
V as Chandrasekhar operators, since the use of this kinetic energy can be traced 
back at least to Chandrasekhar's semiclassical treatment of the stability of stars 
[6] where it can be viewed as the underlying operator. Later the use of T c has 
been investigated by Weder [61] and by Herbst [25]. In the literature the operator 
is sometimes addressed as pseudo-relativistic operator or Herbst operator. 

In the following we assume that the system is neutral, i.e., Z = N, an assumption 
that we make mainly because of notational convenience. It follows from Kato's 
inequality (with sharp constant), (2/7r)|a;| _1 < that the Chandrasekhar form 
£ c is bounded from below, if and only if 

(3) k := Z/c< 2/tt. 

Henceforth we assume this condition. 

The ground state energy of a heavy atom with atomic number Z is given by 

(4) E* ) (Z):=m{{£*(i,)\i,eQ N , U\\ = 1} 

where # refers - as above - either to the Chandrasekhar or the Schrodingcr oper- 
ator, the former being dependent additionally on k. We are interested in E^(Z). 
However, E S (Z) will also play an essential role, namely in regularizing the energy. 
Note that £ c < £ 5 , which implies that E%{Z) < E S (Z). O ur main result strength- 
ens a result by Solovej, S0rensen, and Spitzer [56] in the atomic case to the critical 
value of the coupling constant. 

Theorem 1.1. Let k G (0, 2/ir] and q G N. In the limit Z — > oo with k = Z/c fixed 
and N = Z, 

(5) EZ(Z) = E s (Z)-qs(K)Z 2 + o(Z 2 ) 
where 



(6) s(k) ~«T 2 tr 



(Vp^TT- i - k\x\~ 1 )_- (\v 2 - ^M" 1 ). 



In (6) we used the notation A_ := — Ax(-oofl) (A) for the negative part of a 
self-adjoint operator A. 
Several remarks apply: 

(1) As already mentioned in the introduction, the asymptotics of the ground- 
state energy E s (Z) of the Schrodingcr atom up to o(Z 2 ) is given by the 
Thomas-Fermi energy and the Scott correction. To state this result pre- 
cisely we introduce the Thomas-Fermi functional (Lenz [31]) 



(7) £ TF (p) := / 



dx + D(p, p) 



where, in our units, 7tf = (67r 2 /g) 2//3 /2 and where 

D(p,a):=l[ d,d, 
2Jr3 x r3 \x-y\ 

is the Coulomb scalar product. We define 
(8) ^tf(^) := \ni{£ TF {p) \ p G L^R 3 ) n i 5/3 (M 3 ), p > 0} 
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to be the minimal Thomas-Fermi energy. By scaling one finds that 

-£/tf(-^) — ^TF ^ . 
The asymptotic formula 

(9) E S {Z) = E TF {Z) + \qZ 2 + 0{Z i7 ' 2i ) 

was proven in [48, 49]; for a lower bound only, see Hughes [26, 27]. Inserting 
this into (5) one finds that 

(10) E°(Z) = E TF (Z) + (i - *(«)) qZ 2 + o(Z 2 ). 

(2) The spectral shift s(k) is monotone increasing with respect to k and strictly 
positive for k > 0. Indeed, by scaling x i— » x/k, 



s(k) = tr 



(V «- V + k- 4 - - - (b 2 - N" 1 ). 



and ^/ K~ 2 p 2 + k~ 4 — n~ 2 is monotone decreasing with respect to k. 
(3) It is part of our assertion that the operator in brackets in (6) belongs to 
the trace class. In the subcritical case k < 2/tt this was already proved by 
S0rensen [43]. The finiteness of s(2/n) should not be taken for granted: in 
fact, when substituting the Dirac operator for the Chandrasekar operator in 
(6) it was shown numerically that the corresponding spectral shift diverges 
at the critical coupling [43]. 
Since neither the Schrodinger nor the Chandrasckhar operator depend explicitly 
on spin, we shall assume henceforth q = 1; the general case follows along the same 
line. We prove Theorem 1.1 in Section 3 after having established a precise bound 
on the spectral shift for one-particle operators in the next section. 



2. Bound on the Spectral Shift 

For any real-valued potential v for which the following operators can be defined 
according to Friedrichs, we set 

(11) S(v):=\p 2 -v, 

(12) C(v) :=^p 2 + l-l-v, 

the Schrodinger respectively Chandrasekar operator in L 2 (R 3 ). We assume c = 1 
throughout this section. 

If the potential v is radially symmetric, both the Schrodinger and the Chan- 
drasekhar operator commute with the angular momentum operators allowing for 
a decomposition into the corresponding invariant subspaces. For each I G No the 
subspace Sji spanned by the spherical harmonics Y^ m with m = —I,..., I, is an 
invariant subspace of S(v) and C(v), and ®fl Sji = L 2 (R 3 ). We write A; for the 
orthogonal projection onto fii and 

(13) tr,(A) :=tr(A z A) 

for the corresponding reduced trace. 

Our main result in this section concerns the decay of the spectral shift 

tr, ([C(v)}_ [S(v)}_) 

as the angular momentum I increases. We shall prove 
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Theorem 2.1. There exists a constant M such that for all [i > and for all I G No 

and for all v : [0, oo) — > [0, oo) satisfying 

(14) v(r)<lr-i 

the sum of eigenvalue differences for angular momentum I is bounded according to 

(15) < tr, ([C(v) [S(v) + /*]_) < M(l + l)" 2 . 

This theorem shows that there is an effective cancelation in the difference in 
(15). Indeed, if v(r) = Kr -1 , then 

tr i [^- 1 )]_ = (2/ + l)^E^ T7 )I 

n— 1 v J 

and this does not decay at all as I — > oo. We note also that (15) implies that the 
operator {^Jp 2 + 1 — 1 — — (ip 2 — appearing in Theorem 1.1 is 

trace class for any k 6 (0, ^]. 

2.1. Reminder on Lieb-Thirring Estimates. In the proof of Theorem 2.1 we 
use the following relativistic Lieb-Thirring inequalities due to Daubechies [7]. 

Proposition 2.2. For any 7 > \ there exists a constant L 1 such that for all I > 



7 

1 

+7 



dr. 



poo r 1 

(16) tr, [C(v)]l < L 7 (2l + 1) / Kr)]^ + [v(r)] 2 

Jo 

Proposition 2.2 is also valid for 7=5, but we will not need this fact. 

Proof. Since tr, [C(w)]l < (21+1) tro [C(w)]l, it suffices to verify the claim for I = 0. 
If we extend v to an even function v on R, then C(v) is unitarily equivalent to the 
part of the whole-line operator \/p 2 + 1 — 1 — v on antisymmetric functions. In the 
whole-line case, the result follows by evaluating the integral in [7, Eq. (2.14)]. □ 

Our treatment of the critical case k = § is based on the following inequality [37, 
Theorem 11] of Lieb and Yau. 

Proposition 2.3. Let I be a function with support in {x G M. 3 : \x\ < 1}. Then 
for all \i > 

tr [l(\p\ - I \x\^ -fj)7]_ < const ^ j |/(a;)| 2 da;. 

2.2. Finiteness of Partial Traces. In (15) appears the trace of the difference 
of the operators [C(v) + and [S(v) + /i]_. We begin by proving that both 
operators separately have finite traces. Since S(v) < C(v) (see also (25) below) it 
suffices to prove this in the relativistic case. 

Lemma 2.4. For all I E N one has tr, [C (f < 00. 

Proof. Pick a Lipschitz function ip : R + — > [0,7r/2] with Lipschitz constant </> 
which vanishes for r < 1/2 and which is tt/2 for r > 1. Then / := cos(Lp) has 
compact support around the origin and, furthermore, it constitutes together with 
A := sin(i^) a quadratic partition of unity, i.e., I 2 + A 2 = 1. According to Lieb and 
Yau [37, Theorem 9] we have the localization formula 

(17) (V, (p 2 + 1) 1/2 ^) = (/V, (P 2 + l) 1/2 ^> + (W, ( P 2 + l) 1 / 2 ^) - (V, Lip) 
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for ip G L 2 (M 3 ). Here L is the bounded integral operator on i 2 (M 3 ) with non- 
negative kernel given in terms of a Bessel function 

(18) L(a;,y) .-^2(|a:-y|) _ y \ 2 ■ 

We shall estimate this localization error by a multiplication operator. More pre- 
cisely, we shall show that there exists a constant M > such that 

(19) (ip,Lip) < M(^,e~ lx ^). 

To prove this, we note that by the Schwarz inequality we have 

™ 2 ((<p(\x\)-<p(\y\))/2) 



(, .Lr] I dx v(x)\ 2 I dyK 2 {\x-y\)- 



2 

Z>0 ' 



I — i / dx \^(x)\ 2 / dyK 2 (\x-y\) 

1 JR 3 



(20) VW J\ x \< 

+ [ dx\^{x)\ 2 [ dyK 2 (\x-y\) 

\ ZlT / J\x\>i J\y\<i 

=:(V>,W/V) + (ip,VAip)- 
Since J °° drr 2 K 2 (r) = 3tt/2 [38, Formula 11.4.22] we have 

w/0»0 := X{|*|<i}(z) jf ^ dy jr 2 (|a; - = ^X{|*|<i}(z)- 

Moreover, since K 2 {r) = 2/r 2 + 0(1) as r J, and K 2 (r) ~ yjn/(2r) cxp(— r) as 
r — > oo [42] , the function 

va{x) := X{\x\>i}{x) J + ^di/^dyl) 

is well-defined and satisfies im(ie) < const e - ^. This proves (19). 
Combining (17) and (19) we find that 

(21) tr l [C($\x\- 1 )]_ 

<tr, [-f(c(f +Me-I x l))/] + tr, (c (f |^|^ x + Me"l x l)) A . 

To estimate the inner part we use that 

I (C (f l^r 1 + Me-N)) / > J (H - fl^r 1 - 1 - Me- 1 ) I. 

It follows therefore from Proposition 2.3 that the corresponding trace is finite (even 
when summed over all /). For the outer part we use 

A (C (IN- 1 + Me-W)) A>C (x {|x| >i } f N" 1 + Me~^ . 
The corresponding trace is finite by Proposition 2.2. □ 
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2.3. Angular Momentum Barrier Inequalities. A straightforward consequence 
of Hardy's inequality, which we will frequently exploit, is 

Lemma 2.5. Let ( e No. Then, as operators in fit 

(22) P 2 >{l + \fr- 2 . 

Proof. Writing the Laplacian in spherical coordinates we find that p 2 in 9)i is uni- 
tarily equivalent to p 2 + l(l + l)r~ 2 in L 2 (R + ). The claim follows hence from Hardy's 
inequality, pi > (2r)~ 2 . □ 

By operator monotonicity of the square root, (22) implies the (non-sharp) in- 
equality \p\ > (I + \)r^ in $r)i. We shall need an analogue of this inequality for 
the operator C(0) instead of \p\. Note that \Jp 2 + 1 — 1 behaves as \p 2 for small 
p. Since 'small p' corresponds intuitively to 'large r\ we cannot expect that C(0) 
controls an r _1 decay. But it does control an r _1 singularity. This is the content 
of 

Lemma 2.6. Let I e N 0; R > and M t (R) := {I + \) 2 / (r + ^R? + (I + ±) 2 ) . 
Then, as operators in f); 

(23) C(0) ^MKflJxfKfliWr" 1 . 

Proof. The inequality (22) and operator monotonicity of the square root imply in 

V^TT-l > V / a + i) 2 r- 2 + l-l; 
the claim follows by determining the solution of the equation 

yj(l + |)2 r -2 + 1 = x + Mr -i. 

□ 

The core of Theorem 2.1 is contained in the following 

Lemma 2.7. There exists a constant such that for all v : [0, oo) — ► [0, oo) satisfying 
(14) for all /U > and for all I gN one /ias 

0<tr I ([C(«)+/i]_-[5(i;)+A i ]_) 

< const (tr ; [C( Wi )]i + (I + ^ t n 

w/iere ioj(r) := 10r" 1 X{ r >;2/4 } (r). 

Proof. The identity 

(25) i P 2 = C(0) + iC(0) 2 

implies the non-negativity asserted in (24). 

To prove the second inequality in (24) we shall first assume (in addition to (14)) 
that v is a bounded function and that \i > 0. Once the inequality is proved in this 
case (with a constant independent of fi and the suprcmum of v), we can apply it 
to the cut-off potential Vm '■= min{w, M}. 

By monotone convergence C(vm) and S(vm) converge to C(v) and S(v) in strong 
resolvent sense [8, Thm. 1.2.3], and therefore [45, Thm. VIII.20], [53, Thm. 2.7] 
for any jj, > 0, lim infM^oo trj [C(vm) + lA- — [^( v ) + A 4 ]- an< i similarly for 
S(vm). But the reverse inequalities are also true, since C(vm) > C(v) and S(vm) > 
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S(v). Hence we conclude that tr, (\C{vm) + /->]_ — [S(vm) + A*]-) converges to the 
corresponding quantity with vm replaced by v. Finally, wc can use Lemma 2.4 to 
extend the result to /i — ► 0. 

Thus we may assume v to be bounded, \i > and denote by 7, the orthogonal 
projection onto the eigenspace of C(v) corresponding to angular momentum I and 
eigenvalues less or equal than — fi. Since v is bounded, any eigenfunction of C(v) 
lies in the form domain of S(v). Hence the variational principle together with (25) 
yields 

(26) 2 tr, (lC(v) + »]_- [S(v) + »]_) < tr, [C(0) 2 7 ,] . 

Again the boundedness of v and the finite rank of 7, imply that tr; [C(0) 2 7,] is finite. 
Using the eigenvalue equation and the bound (14) on the potential we estimate this 
term further as follows. 

(27) tr, [C(0) 2 7 ,] < tr, [C(«)] 2 _+tr, [« 2 7 ,] < tr, [C(£ M" 1 )] 2 + (f f tr, [\x\~\] . 
The last term in the above inequality is bounded using (22) and (25), 

(28) HM-Vif < (1 + hY 2 \\\p\M 2 = (1 + hY 2 (lic(o)^il 2 + m,c(om) 

valid for -0, £ fit- Since I > 1 we have (§) 2 (^ + ^) -2 < \ and thus the last two 
estimates may be summarized as 

(29) tr, [C(0) 2 7 ,] < 2tr, [C(f N" 1 )] 2 _ + 4(f ) 2 (Z + I)" 2 tr, [C(0) 7; ] . 
In view of (26) the assertion will follow, if we can prove 

(30) tr, \C{l\x\- v )\\ < tr, [C( Wl )t , tr, [C(0) 7 ,] < tr, [C( W ,)]_ . 
We begin with the (more difficult) second inequality. We have 

(31) tr, [C(0) 7 ,] < tr, [tryi] < f tr, pafl" 1 ^] < ^ [kl'S,] . 

We apply Lemma 2.6 with R = I 2 /A to bound the last term. Since M,(/ 2 /4) > 5/4 
for I > 1 we obtain 

(32) (V^M-V;) <^(^,C(0M) + (^,X{r>P/4}(W)W"Vi). 
The last two estimates can be summarized as 

(33) tr, [C(0) 7 ,] < - tr, [C(w,) 7 ,] < tr, [C(wi)}_ , 

which proves the second inequality in (30) . We proceed similarly to prove the first 
one. Indeed, by (32) 

(34) cdM- 1 ) > |c(o) + X{r <pM\x\) N" 1 - IN- 1 > |C( W ,) 

and hence tr, [C(-||a;| _1 )] __ < tr, [C(w,)] 2 _. This completes the proof of the 
lemma. □ 

Now everything is in place for the 

Proof of Theorem 2.1. The boundedness of the trace in (15) for / = is implied 
by Lemma 2.4 below, and its non-negativity follows from (25). For I > 1 we use 
Lemma 2.7 and note that 

(35) tr, [C(wi)] 2 _ < const r 2 , tr, [C{wi)f_ < const 

by Proposition 2.2. □ 
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3. Proof of the Main Results: Renormalization of the Relativistic 

Operator 

The strategy for our proof of the main results is to use the Schrodinger operator 
as a regularization for the relativistic problem, i.e., we will use it to eliminate the 
main contribution to the energy - the Thomas-Fermi energy - and to focus only 
on the energy shift of the low lying states where the electron-electron interaction 
plays no role and the unscreened problem remains. 

Recall that Theorem 1.1 for q = 1 reads 

We will show this claim in two steps, namely that the upper limit and the lower 
limits exist and are given be the same expression, namely the coefficient of the 
Z 2 -corrcction claimed in the theorem. That this coefficient is finite was already 
remarked after Theorem 2.1. 

3.1. Upper Bound on the Energy Difference Lower Bound on the Rela- 
tivistic Energy. Lieb and Simon [36] showed that the Thomas-Fermi minimization 
problem (8) has a unique minimizer pz, the Thomas- Fermi density. It fulfills the 
scaling relation 

(37) Pz {x):^Z 2 p l {Z 1 ^x). 

We define the radius of the Thomas-Fermi exchange hole at point x e R 3 as the 
unique minimal radius Rz{x) for which 

(38) / Pz{y)Ay=\. 

J\x-y\<Rz(x) 1 

We denote the exchange-hole-reduced Thomas-Fermi screening potential by 

(39) xtf(z) := / f^dy 

J\x-y\>Rz(x) F - y\ 

and the corresponding one-particle operators by 

(40) S TF = S{Z\x\~ l - X tf) 

(41) C TF = C c {Z\x\~ l - xtf) 

both self-adjointly realized in L 2 (R 3 ). Here we use a notation similar to that in 
Section 2, 

(42) C c (v) := ^p 2 c 2 + c 4 - c 2 - v. 

We remark that we slightly deviate from the more usual choice Z\x\ _1 — pz* | • | (#) 
for the screened potential. This is motivated by the correlation inequality (44) 
below. The concept of an exchange hole can be traced back to Slater [54] . It also 
has been used to estimate the exchange-correlation energy (Lieb [32], Lieb and 
Oxford [35]). 

We shall express the many-particle ground-state energy in terms of quantities 
involving the above one-particle operators. In the relativistic case we use the corre- 
lation inequality of [39] to obtain a lower bound on the many-particle ground-state 
energy. 
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Lemma 3.1. For all L e N, 

L— 1 oo 

(43) E C K {Z) > - ^tr ; [C c (Z\x\~ 1 )]_ - [C TF ]_ - D(p z ,p z ). 

1=0 l=L 

Proof. We use the correlation inequality [39, Equation (14)] 

N 

(44) E \ x f ~ x "\~ 1 ^ X)xtf(^) -D(pz,p z ), 

1<H<v<N v=l 

to bound E%(Z) from below by the ground-state energy of 

N 

This yields 

(45) E C K {Z) > - tr[C TF ]_ - D(p z , Pz ). 

We split the trace according to angular momentum and use the operator inequality 
Ctf > C c {Z\x\~ v ) for all/ < L — 1 to obtain the assertion. □ 

In the non-relativistic case, we recall 

Proposition 3.2. Let L := [Z 1 / 9 ] . Then, as Z -> oo, 

L-l oo 

(46) £ S (Z) = - E tr, ^(ZN" 1 )], - [5 TF ]_ - D(p z , pz) + 0(Z 47 / 24 ). 

(=0 i=L 

Proof. The same argument as in Lemma 3.1 yields the lower bound 

L — 1 oo 

(47) £ S (Z) > - ^ tr, [S^laT 1 )] _ - E tr ' ^f]_ - D(p z , Pz ). 

1=0 l=L 

Note that the xtf < pz * | • Hence [51, Theorem 1] and the proof of this 
theorem (in particular, [51, Lemma 2], see also [52]) show that one can further 
estimate 

L-l oo 

- E tr < [S(SM -1 )] - - E tr ' [ 5 tfL - D(p z , pz) 

( 48 ) Z=0 l=L 

> E TF (Z) + \Z 2 - const Z 17/9 logZ. 
On the other hand, one has the upper bound [48, Lemmas 3.1 and 4.1] 

(49) E S (Z) < E TF {Z) + \Z 2 + const Z 47 / 24 

Combining this with (47) and (48) we obtain the assertion. □ 

Proof of Theorem 1.1 - first part. Choosing L — [Z 1 / 9 ] and combining Lemma 3.1 
and Proposition 3.2 we obtain 

(50) E S (Z) -E C K {Z) <-£> ([S(Z\x\- 1 )]_ [C c {Z\x\-')]_) 

1=0 

oo 

- E tr; ([5tf]_ - [Ctf]_) + const Z 47 / 24 . 
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We note that by scaling x ^ x/c, the operators Stf and Ctf are unitarily equiva- 
lent to the operators k~ 2 Z 2 S(k\x\~ 1 ~xz) and k~ 2 Z 2 Ci(k\x\~ 1 —\z), both acting 
in L 2 (R 3 ), where 



(51) X z(x) := k 2 Z- 2 X tf(kx/Z). 
This implies 

(52) limsup ^^ y 2 E "^ < K" 2 limsup(S 1 (Z) + S 2 (Z)) 



where 



L-l 



Sx(z) ^^^([CiW 1 )]--^!- 1 )].) 

oo 

E 2 (Z) ([d^r 1 - X z)]_ - [SfcM" 1 -*z]_) • 



Theorem 2.1 implies that the summands in both sums on the right-hand side are 
non-negative and bounded by const (I + 1)~ 2 independently of Z. Therefore the 
first sum actually converges 



(53) limsupSi(Z) = jTtn ([Ci(k\x\- 1 )]_- [S^M" 1 )] _) . 

z ^°° 1=0 

Moreover, the second sum converges to zero, 

oo 

(54) lim sup £ 2 (Z) < const limsup^(/ + 1)~ 2 = 0. 

This concludes the proof of the upper bound on the energy difference. □ 

We remark that E 2 (Z) < const Z -1 / 9 , hence we have actually shown that 

(55) E S {Z) - E°{Z) < s{k)Z 2 + const Z 47 / 24 . 

3.2. Lower Bound on the Energy Difference Upper Bound on the Rel- 
ativistic Energy. Following [48] we define two one-particle density matrices d s 
and d c as sums 

oo 

(56) d# = J2d*. 

1=0 

As above, we use the convention that # refers either to the Schrodinger case or 
to the Chandrasekhar case. The operators df are defined in L 2 (M. 3 ) through their 
integral kernels 



(57) d*(x,y):=J2 w n,l ^ti, m ( x )^ti, m (y)- 



n=l m= — l 



The weights w n ,i and the functions m are defined separately for angular mo- 
mentum I < L and I > L where L will be chosen later in a Z-dependent way. 
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Case I < L: We define tpf l m as the n-th eigenfunction of S(Z\x\^ 1 ) re- 
stricted to angular momentum (l,m), respectively of C c {Z\x\~ x ) restricted 
to angular momentum (Z,m), with the normalization Wip^i mil 2 = 1- Note 
that this function is of the form 4>nim( x ) = L Pnd(\x\)Yi y7n (x/\x\) with a 
radial function <p n ,i- The weights w n ,i arc defined independently of m by 



(58) w nt i := 



1 n<K-l, 
n>K-l. 



where K := [dZ 1 / 3 ] with d some positive constant independent of Z. 
Case Z > L: We choose m ( x ) = ( Pn,i(\x\)Y^ m (x /\x\) where the functions 
tp n ,i, as well as the weights w n i, are defined exactly as in [48, Section 2] 
independently of #. (The exact form of the functions and the values of the 
weights for Z > L are irrelevant in our context.) 

Note that the above construction guarantes d& to be density matrices, i.e., < 
g? # < 1. Moreover, by the choice of L, K, and w n j one can assure that trd# < Z. 
(For # = S this is proved in [48, Corollary 4.1], and follows hence also for # = C.) 

Since df is independent of # for Z > L we drop the superscript in this case. 
Moreover, we shall use the notations 

L-l oc 

d*:=J2 d f> dy-.^^di, 

1=0 l=L 

and 

pf(x) := df(x,x), p%{x) := d%(x,x), p>(x) := d>(x,x). 

We recall now that the density matrix d s gives an energy which is correct up to 
the order we are interested in. More precisely, one has 

Proposition 3.3. Let L := [Z 1 / 12 ] . Then, for sufficiently large Z , 

(59) E S {Z) = tr^ZlxrV 5 ] + D(p s , p s ) + 0(Z 47 / 24 ). 
Proof. It is shown in [48] that for sufficiently large Z, 

E S {Z) < tr[S(Z|a;|- 1 )(i s ]+ J D(p s ,p s ) < E TF (Z) + \Z 2 + const Z 47 / 24 . 

Combining this with the lower bound on E S (Z) which was recalled in (47) and 
(48), we obtain the assertion. □ 

We decrease the ground state energy further by dropping a part of the Coulomb 
energy, 

(60) E S (Z) > tr^Zlxr 1 )^] +tr[S(Z\x\- 1 )d > ] +D{ P>lP> ) - const Z 47 / 24 . 

For an upper bound in the relativistic case we employ a variational principle to 
obtain 

Lemma 3.4. For sufficiently large Z 

E C K {Z) < tv[C c (Z/\x\)d%] + tr[5(Z/|x|)rf>] + D( P> , P> ) + 2D{p% P> ) + D(p% p%). 

Proof. As noted above, d c satisfies < d c < 1 and trd c < Z for sufficiently large 
Z [48, Corollary 4.1]. Using that the Hartree-Fock functional bounds the ground 
state energy from above - even if non-idempotent density matrices are inserted, a 
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fact that was proven by Lieb [33] (see also Bach [3]) - and estimating the indirect 
part of the Coulomb energy by zero we obtain 

(61) E^(Z)<tv[C c (Z\x\- 1 )d c ]+D(p c ,p c ). 

Both terms on the right-hand side are split according to d c = + d> . To obtain 
the desired upper bound we use the inequality \-p 2 > \J c 2 p 2 + c 4 — c 2 for large 
angular momenta. □ 

The following lemma shows the irrelevance of the interaction energy of the low 
lying states with all other electrons (including themselves) . The proof follows the 
strategy pursued in [48], namely to estimate it by the lowest Coulomb energy of a 
particle in the field of an external point charge Z, and then simply multiplying by 
the particle number. There is, however, one important change in the channel I = 0. 
Because of the singular nature of the lowest eigenfunctions, their expectations in 
potentials with Coulomb singularities does not exist. To circumvent this problem 
we use the Hardy-Littlewood-Sobolev inequality followed by a recent Sobolev-type 
inequality [24]. 

Lemma 3.5. One has D{p%p c ) < const Z n / 6 logZ. 

Proof. We treat the terms D(p<, p<) and D{p ( ^,p > ) separately. For the latter one 
we recall that 

(62) J pf{x)dx = (2l + l)(K -I), 0<1<L, 
where K = 0(Z x l z ) and that by Proposition 3.4 in [48] 

(63) £ ( ^dx < [ P -^±dx < const Z^. 
l=L J \x\ J \x\ 

The densities pf are spherically symmetric because of the addition formula for the 
spherical harmonics. Hence, using Newton's theorem [41], we have 

(64) L _! 

< const J^i 21 + l )i K - l )Z 4/3 < 0{L 2 KZ^' z ) = 0(Z n / 6 ). 

1=0 

We set := p< — Pq and estimate 

(65) D{p% p%) < 2D(p%, ) + 2D{p c <> ,p c <> ). 

This allows to treat the contributions from / = and 1 < I < L separately. Using a 
scaled version of Lemma 2.6 with Ri := ((? + i) 2 — 4k 2 ) /4k we obtain for 1 < I < L 

trdxp^p) < -Ltr[C c (0)d?]+tr( X{ \ x \>R l / c} \x\- 1 df) 

^itrd^dfj + l-trdp, 

where the last inequality used the fact that eigenfunctions of df are also eigenfunc- 
tions of CciZlx]^ 1 ) with negative eigenvalue. Hence, summing over I and noting 
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that Ri l < const l~ 2 , 

J ^Md^^trd^-^D^constzXir 2 ! pf(x)dx. 



1=1 1=1 
Thus by (62) and again by Newton's theorem 

p<>(y) 



dy 



I>0><>,P<>) <\ J p^Wdx J , 

< const KL 2 KZlogL < const Z 11/6 logZ. 

Finally, we treat the term corresponding to I = 0. By the Hardy-Littlewood- 
Sobolev inequality (c.f. [34]) and by Holder's inequality 

/ (S^nAO^I 2 ) 

< const ^ 1/3 (E / Ko,o(-)| 12/5 d^ . 
Now we use the Sobolev-type inequality [24, Eq. (2.8)]) 

(66) ||u|| 2 2/5 < const («,(N-£M->> 1/2 Nl 

where the first factor on the right-hand side is to be understood in form sense. Using 
that \p\ — -p: | a? | 1 < c~ 1 C c (Z\x\~ 1 ) + c and that ip^ o o i s a normalized eigenfunction 
of C c {Z\x\~ 1 ) we deduce 

(67) lKo,olli2/5 < const c 1 / 4 . 
Combining the previous relations we arrive at 

(68) D(p$,p$) < const K 1/3 (Kc 3/5 ) 5/3 < const Z 5/3 . 

This completes the proof of the lemma. □ 
Proof of Theorem 1.1 - second part. It follows from Lemma 3.5 that 

2D(pZ,p > ) + D(pZ,p%) = 0(Z u f* log Z). 
Hence Lemma 3.4 together with (60) implies 
liminfZ- 2 [E S (Z)-E%(Z)] 



Z->oo 



>liminfZ- 2 <^ tr [S(Z\x\~ l ) d s <\ - tr [C^Z^ 1 ) d£] 

2 — >-00 ' 



liminf + 1) E [«I, m .^kl _1 Xj,m> 



L-l K-i 
(=U n=l 

The claim now follows from the scaling x i— > x/c and Fatou's lemma. □ 

In order to get an explicit remainder estimate one could bound the sum 
(69) 

L I oo 

£(2i + l) E [<< ; , m ,^kr 1 )<, m )-(< ; , m ,C' c (Z| a ;|- 1 )C ; , m )] 
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from above. This is certainly not difficult but for brevity we refrain from doing so. 
The sum corresponding to I > L can be bounded using Theorem 2.1. 
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